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2^ This paper formulates a necessary and sufficient condition for a generic 

graph matching problem to be equivalent to the maximum vertex and ed- 
ge weight clique problem in a derived association graph. The consequences 
of this results are threefold: first, the condition is general enough to co- 
ver a broad range of practical graph matching problems; second, a proof to 
establish equivalence between graph matching and clique search reduces to 
showing that a given graph matching problem satisfies the proposed condi- 
tion; and third, the result sets the scene for generic continuous solutions for 
a broad range of graph matching problems. To illustrate the mathematical 
t— l framework, we apply it to a number of graph matching problems, including 

the problem of determining the graph edit distance. 

oo 

^ 1 Introduction 

The poor representational capabilities of feature vectors triggered the field of structural 
<3\ pattern recognition in the early 1970s to focus on pattern analysis tasks, where struc- 

tured data is represented in terms of strings, trees, or graphs. A key issue in structural 
pattern recognition is to measure the proximity of two structural descriptions in terms 
of their structurally consistent or inconsistent parts. There is also strong evidence that 
human cognitive models of comparison and analogy establish relational correspondences 
between structured objects [8119]. The problem of measuring the structural proximity 



of graphs, more generally referred to as the graph matching problem, is often computatio- 
nally inefficient. Therefore, an appropriate formulation of the graph matching problem is 
important to devise efficient solutions and to gain insight into the nature of the problem. 

One popular technique is to transform graph matching to an equivalent clique search 
in a derived auxiliary structure, called association graph 1 1 , 2 , 18j[l5][l9] . Chen & Yun [6] 
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generalized association graph techniques by compiling results from |2,5 12 and showing 
that the maximum common (induced) subgraph problem and its derivations can be 
casted to a maximum clique problem. Pelillo [16] extended this collection by transforming 
the problem of matching free trees^] to the MVCP so that connectivity is preserved. 
Bunke 14] showed that for special cost functions, the error correcting graph matching 
problem and the maximum common subgraph are equivalent. As a consequence, special 
graph edit distances can be computed via clique search in an association graph. Schadler 
& Wysotzki [20] mapped the best monomorphic graph matching problem to a maximum 
weighted clique problem without presenting a sound theoretical justification of their 
transformation. 

Although association graph techniques are attractive, an equivalence relationship bet- 
ween graph matching and clique search has been established only for certain classes 
of graph matching problems. Examples for graph matching problems, which have been 
proven equivalent to clique search in an association graph are the maximum common 
(induced) subgraph and special cases thereof, or the graph and tree edit distance for spe- 
cial cost functions. To utilize the benefits of association graph techniques for a possibly 
broad range of graph matching problems, we pose the following question: 

Under which conditions is graph matching equivalent to clique search in an 
association graph? 



With the above question in mind, we make the following contributions: 

• We formulate a necessary and sufficient condition (C), for graph matching problems 
to be equivalent to the maximum weight clique problem in an association graph. 
As opposed to standard formulations, the maximum weight clique problem takes 
into account weights assigned to both vertices and edges. 

• Condition (C) is sufficiently strict to cover a broad range of graph matching pro- 
blems. Besides the well-known standard problems, we show, for example, that 
determining the graph edit distance is equivalent to the maximum weight clique 
problem in an association graph. 

• The mathematical framework developed in this paper provides a proof technique 
that reduces the problem of showing equivalence between graph matching and 
clique search to the problem of showing that a given graph matching problem 
satisfies condition (C). By means of a number of examples, we illustrate how the 
novel technique considerably simplifies equivalence proofs. 

The paper is organized as follows. We conclude this section by introducing the ter- 
minology. Section 2 presents the background and an intuitive idea of condition (C). In 
Section 3, we formulate condition (C) and prove that (C) is necessary and sufficient 
for the desired equivalence realtionship. We apply (C) in Section 4. Finally, Section p] 
concludes with a summary of the main results and an outlook on further research. 

1 A free tree is a directed acyclic graph without a root. 
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1.1 Preliminaries 



The aim of this subsection is to introduce the terminology and notations used througout 
this contribution. We assume knowledge about basic graph theory. 

Basic Graph Theory 

Graphs For convenience of presentation, all graphs are undirected without loops. The 
graphs we consider are triples X = (V,E,X) consisting of a finite set V = V(X) of 
vertices, a set E = E{X) of edges, and an attributed adjacency matrix X = (x{j) with 
elements Xij from a set of attributes A Li {e}. Elements Xij E A are attributes either 
assigned to vertices if i = j or to edges if (£, j) E E. We label non-edges &E with 
a distinguished void attribute x^ = e. 

Subgraphs We write X' C X to denote that X' is a subgraph of X. For a subset U of 
V(X), the graph X[U] denotes the induced subgraph of X induced by U. 

Items To unclutter the text from tedious case distinctions, we occasionally make use of 
the notion of item. Items of X are elements from I(X) = V xV. Thus, an item £ = (i, j) 
of X is either a vertex if i = j, an edge if % E E(X), or a non-edge if £ €E(X). 

Morphisms Let X and Y be graphs. A morphism from X to Y is a mapping 



A partial morphism from X to Y is a morphism <f> defined on a subset of V(X). By 
T>{(j)) we denote the domain and by H((ft) the range of a partial morphism (j). By abuse 
of notation we occasionally write i = £ D(4>) if £ £ ^(0) an d j £ T>{(j)). The 

meaning of £ £ TZ(<p) is now obvious. A monomorphism is an injective morphism. We 
use the notions homo- and isomorphism as in standard graph theory. A partial subgraph- 
morphism is an isomorphism between subgraphs. 

The Maximum Weight Clique Problem 

Let X = (V,E,X) be a graph with attributes from A = K U {e}. A clique of X is a 
subset C C V such that the induced subgraph X[C] is complete. A clique is said to 
be maximal if C is not contained in any larger clique of X, and maximum if C has 
maximum cardinality of vertices. The weight oo{C) of a clique C of X is defined by 



The weight of a clique C is the total of all vertex and edge weights of the subgraph X[C] 
induced by C. Since the vertices of X[C] are mutually adjacent, the void symbol e does 
not occur in the sum of (TTj) . Hence, u>(C) is well-defined. 



4> ■ V{X) - V(Y) 



% i — t % 




(1) 
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The maximum weight clique problem is a combinatorial optimization problem of the 
form 

maximize <*>(C) = Xij 

i,jec (2) 

subject to C 6 Cx, 

where Cx is the set of all cliques of X. 

A maximal weight clique of X is a clique C G Cx such that 

CCC' ^ w(C) > w(C) 

for all cliques C" of X. It is impossible to enlarge a maximal weight clique C to a clique 
with higher weight. If all vertices and edges of X are associated with positive weights, 
a maximal weight clique is not a proper subset of another clique. A maximum weight 
clique of X is a clique C € Cx with maximum total weight over its vertices and edges. 
By C x we denote the set of all maximal cliques and by C* x the set of all maximum cliques 
of X. 

2 Background 

The aim of this section is twofold: First, it introduces the problem and motivates its 
solution. To this end, we consider the classical maximum common induced subgraph 
problem (MCISP). Second, it provides an intuitive idea of how to solve that problem in 
a more general setting. 

2.1 The Problem 

To set the scene, we consider the MCISP. Given two graphs X and Y, the MCISP asks 
for a partial isomorphism <p : V(X) — ► V(Y) that maximizes the cardinality |P(<^)| of its 
domain. This optimization problem is called MCISP, because each partial isomorphism 
4> between X and Y is an isomorphism between the induced subgraphs X [D (</))] and 
Y\R{<j})\. Since maximizing \V{<f))\ 2 instead of |£>(</>)l does not effect the problem, we 
may rewrite the MCISP as 

maximize f(<p,X,Y) = \T>(4>)\ 2 = ^ 

ijev(<t>) (3) 

subject to 4> £ M., 

where the search space M is the set of all partial isomorphisms from X to Y. The values 
Kijrs £ {0, 1} are the compatibility values with 

K . . — J * : X ij = Vrs (a\ 

Klirs ~ \ : otherwise 1 > 

for all i,j € V(X) and r,s £ V(Y). A compatibility value Kij rs indicates whether item 
(i,j) of X has the same attribute as item (r,s) of Y. The matching objective f counts 
the number of items from X consistently mapped to their exact counterparts in Y. 
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It is well known that the MCISP is NP-complete [7] . Therefore, exact algorithms that 
guarantee to return an optimal solution are useless for all but the smallest graphs. In a 
practical setting, the time required to compute an optimal solution will typically reduce 
the overall utility of the algorthim. It is rather more desirable to trade quality with 
time and to provide near-optimal solutions within an acceptable time limit. Thus, it 
is conducive to consider local optimal solutions of ([3]). We say a partial isomorphism 
4> € «M is a local optimal solution if there is no other partial isomorphism from A4 with 
larger domain. 

To solve the maximum common induced subgraph problem, we transform it to ano- 
ther combinatorial optimization problem using association graph techniques originally 
2j. An association graph Z = X o Y of X and Y is a graph with 
vertex and edge set 

V(Z) = {ir : xu = y rr } Q V(X) x V(Y) 
E{Z) = {(ir,js) : x {j = y rs ) C V(Z) x V(Z). 

The attributed adjacency matrix Z = (zi r j s ) of Z is defined by 

{1 : ir = js and ir £ V{Z) 
1 : ir ^ js and (ij, rs) £ E(Z) 
e : otherwise 

for all ir,js £ V{Z). Thus, we assign to all vertices and edges of Z the weight 1 and to 
all non-edges the void attribute e. Note that we derived the weights Zi r j s by inserting 
the corresponding compatibility values Kij rs . 

By definition of an association graph, we have the following useful equivalence relati- 
onship between between the partial isomorphisms from A4 and the cliques in Z: 

• Z uniquely encodes each partial isomorphism <f> from M as a clique in Z such 
that \V((j))\ = \C\. 

• The set Cz(M) of cliques encoding partial isomorphisms is equal to the set Cz of 
all cliques in Z. 

The above equivalence directly implies that the maximum (maximal) cliques of Z are 
in one-to-one correspondence with the global (local) optimal partial isomorphisms from 
Ai. Hence, solving the MCISP ^ is equivalent to solving the maximum clique problem 
in Z. 

Benefits of the Association Graph Framework 

What makes an association graph formulation of the MCISP so useful is that 

• the maximum clique problem is mathematically well founded, 

• it provides us access to a plethora of clique algorithms to solve the original problem, 

• abstracts from the particularities of the graphs being matched, and 



introduced by (T 14 
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• abstracts from the constraints on feasible matches. 

The benefits of the second item on clique algorithms is worth to be discussed in more 
detail. Both the MCISP and the maximum clique problem are combinatorial problems. 
Solution techniques for combinatorial problems can be classified into two groups: 

• Discrete methods generate a sequence of suboptimal or partial solutions to the 
original problem. This process is guided by local search combined with techniques 
to escape from local optimal solutions. 

• Continuous methods embed the discrete solution space in a larger continuous one. 
Exploiting the topological and geometric properties of the continuous space, the 
algorithm creates a sequence of points that converges to the solution of the original 
problem. 

Discrete approaches suffer from problems having a large number of local maxima. On the 
other hand, continuous approaches such as interior point methods reveal to be efficient 
solutions to large scale combinatorial optimization problems. Hence, using association 
graph techniques provides us access to those efficient continuous solutions. 

Shortcomings of the Association Graph Framework 

Despite its long tradition and benefits, the association graph formulation still suffers 
from the following shortcomings: 

• It is unclear which graph matching problems are equivalent to clique search in 
an association graph. The equivalence relationship in question has been proved 
only for selected problems. For example, Chen & Yun [6j compiled results from 

, showing that the maximum common (induced) subgraph problem and its 
derivations can be casted to a maximum clique problem. Pelillo and his co-workers 
extended this collection for different types of tree matching problems [3j[l8][T7j[T6] . 

• In all these examples, equivalence proofs of graph matching and clique search 
follow the same recurring pattern: First construct an appropriate association graph 
and then establish a bijective mapping between cliques and morphisms. There is 
no mathematical framework that reduces equivalence proofs to showing that the 
preconditions of some generic equivalence relationship between graph matching 
and clique search are satisfied. 
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What we want 

The aim is to remove both shortcomings addressed in the previous paragraph. We want 
to formulate a necessary and sufficient condition (C) for equivalence between graph mat- 
ching and clique search. The existence of condition (C) explains which graph matching 
problems are equivalent to clique search in an association graph and reduces equivalence 
proofs to showing that a given graph matching problem satisfies (C). In addition, we 
want to indicate by a number of examples that a broad range of graph matching pro- 
blems satisfy condition (C) and are therefore equivalent to clique search in an association 
graph. 
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2.2 The Idea 



Our goal is to present an intuitive idea of a necessary and sufficient condition (C) for 
equivalence between graph matching and clique search. To this end, we turn from the 
MCISP to the general case. Suppose that X and Y are two graphs, and let M.xy be the 
set of all partial morphisms from X to Y. The graph matching problem considered here 
generalizes the MCISP in two ways: 

• By allowing nonnegative real- valued compatibility values /%> s = Kji sr - 

• By allowing any subset of M.xy as search space. 

Allowing real-valued compatibility values Kij rs > enables us to measure the degree of 
compatibility, or consistency, between items i = (i,j) of X and r = (r,s) of Y. This 
generalization is useful to cope with noisy attributes. Considering arbitrary subsets of 
■M-XY provides a flexible mechanism to cope with structural errors. 

Applying both generalizations, the graph matching problem (GMP) is a combinatorial 
optimization problem of the form 

maximize X, y) = 

ieDffl (5) 

subject to (j) £ M., 

where the search space Ai is a subset of A4xy, an d = (i i , y) G I{Y) is the image 
item of item i = G I{X). Problem §5§ is a generic formulation that subsumes a 

broad range of practical graph matching problems. 

Often, solving an instance of problem (|5| is computationally intractable. As for the 
MCISP, we therefore consider simpler versions of §5§ that allow local optimal solutions. 
A partial morphism (j) G A4 is a local optimal solution of §5§ if T>{4>) C T>(ip) implies 
f(<P,X,Y)>f(i;,X,Y) for all V G -M- 

Applying the Association Graph Framework 

The central question at issue is: Under which conditions is the generic graph matching 
problem (|5| equivalent to clique search in a derived association graph Z = X o Y? 
Certainly, an association graph Z should satisfy the following properties: 

PI. Z uniquely encodes each partial morphism cfi from the search space M. as a clique 
Cff, in Z such that f{(j>, X, Y) = oo{C), where uj(C) denotes the weight of clique C. 

P2. The set Cz{M) = {C^ G Cz '■ <fi £ «M} °f an cliques encoding partial morphisms 
from M. is equal to the set Cz of all cliques in Z. 

Ad PI: We can always derive an association graph satisfying the first property PI. 
Consider the complete graph Zxy with vertex set Vxy = Y(X) x V(Y). To each item 
((*} j)j ( r > s )) £ !{%Xy), we assign the weight Ky> a . Since is complete, there are only 
vertex and edge items. We extract a subgraph from Zxy to form an association graph 
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Z as follows: For each partial morphism eft G Ai, we construct a complete subgraph 
of Zxy with vertex set V{Z ( j } ) = C$ = {ii^ : i G The vertex set Ca, is a clique in 

Zxy such that /((/>, X, Y) = w(V^). We obtain an association graph Z C Zxy by taking 
the union of all subgraphs Za with <p G A4. Then be construction Z satisfies property 
PI. Note that construction of Z in the above way is impractical, because it requires 
enumeration of all members of Ai. But it provides a simple way to show that there is 
always an association graph satisfying property PI. 

Ad P2: Now let us turn to the second property P2. We first provide a fictitious example 
to show that P2 does generally not hold. 

Example 1. Consider the set Ai 2 of partial morphisms 4> '■ V(X) — * V(Y) that are 
defined on subsets of exactly two vertices from X, i.e. \D{<j))\ = 2 < \X\. Suppose that 
we construct Z as described in the previous paragraph. Then the set Cz{Ai%) of all 
cliques encoding morphisms from Ai 2 are cliques in Z with exactly two vertices. We may 
encounter the following pitfalls: 

PF1. Let C = {i,j} be a clique in Z encoding <f> G Ai 2 - Then {i} and {j} are both 
cliques in Z not contained in Cz{Ai 2 ). 

PF2. Let C,C',C" be cliques from C Z {M 2 ) with C = {i,j}, C = {j,k}, and C" = 
{i,k}. Then C = {i,j,k} is a clique in Z with three vertices and therefore not 
contained in Cz^M.2)- 

Hence, we have Cz{M-2) 7^ Cz and therefore a graph matching problem |5p defined on 
M.2 is not equivalent to clique search in an association graph. 

Example [T] indicates that equivalence between graph matching and clique search de- 
pends on the structure of the search space M.. So our central question of issue reduces 
to a necessary and sufficient condition on Ai such that Cz(Ai) = Cz- 

A Necessary and Sufficient Condition 

The goal is to present an intuitive idea of a necessary and sufficient condition for 
Cz{AA) = Cz- Our claim is that Cz(Ai) = Cz holds whenever there is a property that 
completely describes the set Ai. 

First, we specify the notion of property. Given a graph matching problem, the purpose 
of a property is to describe the characteristics of the search space Ai in such a way 
that we can derive the desired equivalence relationship. For example, let Ai be the 
set of all partial morphisms (f> from AixY that satisfy the property p to be injective. 
Although property p completely describes the characteristics of Ai, it is not suitable to 
deal with it conveniently. The reason is as follows: According to the pitfalls PF1 and 
PF2 of Example [T] we have to show that restrictions and feasible unions of morphisms 
that satisfy property p also satisfy p. This task is unnecessarily complex and can be 
simplified by imposing a locality restriction on the notion of property. 

A property p on a set S is a binary function p : S — > {0, 1}. We say an element x G S 
satisfies property p if p(x) = 1 for each x G S 1 . We can extend the notion of a property 
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to the subsets of S by considering their local behavior. A subset S' C S satisfies property 
p if p(x) = 1 for each element x G S' , i.e. if S' locally satisfies p. 

Next, we characterize an association graph Z = X oY by means of a property. To this 
end, we consider the complete graph Zxy defined on the vertex set Vxy = V(X) x V(Y). 
Let p be a property on the set Ixy = I{Zxy) of items such that the vertices and edges 
of Z are all the items that satisfy p. Hence, the subsets of Ixy that satisfy property p are 
the induced subgraphs of Z induced by its cliques. We can express the set of all cliques 
in Z as Cz = {U C Ixy '■ U satisfies p}. Now using p, we rewrite condition Cz(-M) = Cz 
as follows: There is a property p such that Cz{AA) = {C G Cz '■ C satisfies p}. Since the 
cliques from Cz (Ai) encode the feasible morphisms from Ai, we may directly translate 
the last condition to a condition on the search space Ai: 

(C) There is a property p such that Ai = {(ft G AixY '■ satisfies p}. 

To summarize, we claim that condition (C) is necessary and sufficient for the equivalence 
relationship in question. 

3 A Necessary and Sufficient Condition 

The aim of this section is to prove that condition (C) is necessary and sufficient for 
equivalence between graph matching and clique search in an association graph. 

3.1 Closed Sets of p-Morphisms 

Here, we consider properties p defined on the set Ixy = X I(Y) of pairs of items 

from X and Y. Let (i, r) G Ixy- We say, items i and r are p-similar, written as i ~ p r, 
if (i,r) satisfies property p. Let us consider some examples. 

Example 2. Let (i, r) £ Ixy, o,nd let all morphisms 4> be partial morphisms from AixY- 
Then the following examples are properties on Ixy-' 

1. i ~ p r = there is a partial (mono-, subgraph-, homo-, iso-) morphism <p with 
i+ = j. 

2. i ~ p r = there is a partial connectivity preserving isomorphism <f> with = j. 

Next, we apply the notion of property on Ixy = X I(Y) to partial morphisms. 

To this end, we identify a partial morphism <j) G AixY with the binary relation 

r(0) = {(t,r) G V(<fr) x K(<t>) :i* = r}Q I XY - 

We say, is a p-morphism if T(<f>) is a subset of the set H XY = {(*, r) G Ixy '■ i ~p 
In informal terms, a partial morphism (j) satisfies property p if it locally satisfies p. To 
illustrate the defintion of p-morphism, we provide some examples. 

Example 3. Consider the properties p of Example^ 
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1. Let p be the property of Example^ 1, and let 4> G Mxy be a partial morphism. 
Since = r for all pairs (i,r) G r(<^) ) we trivially have i ~ p r for all pairs 
(i,r) G r(0). Hence, T(cf)) C Tv^y and therefore <fi is a p-morphism. Similarly, 
mono-, homo, and isomorphisms are p-morphisms. 

2. Let p be the property of Example^ 2, and let <f> be a partial connectivity preserving 
morphism on V{X). As in (1), we find that <p is a p-morphism. Note that for 
non-edge items G E{X) the restriction (f>' of (p to {i,j} does not preserve 
connectivity. But what is important is that <p' can be extended to a partial morphism 
4> that preserves connectivity. 

Let M XY C Mxy denote the set of all p-morphisms. We say, a subset M of Mxy is 
p- closed if M = M XY . ^ ne nex t result provides basic p- closed sets. 

Proposition 1. The following subsets of Mxy are p-closed: 



1. 


M = 


set of all partial morphisms 


2. 


M = 


set of all partial monomorphisms 


3. 


M = 


set of all partial homomorphisms 


4- 


M = 


set of all partial isomorphisms 


5. 


M = 


set of all partial subgraph-morphisms 



Proof: We only show the assertion for the set M of all partial isomorphisms . The 
proofs for the other sets are similar. Let X and Y be graphs with adjacency matrices 
X = (xij) and Y = (y rs ). We define a property p on Ixy such that 

K P XY = {(*,r) : Xi = y r ,type(i) = type(r)} C I{X) x I(Y), 

where type(i) maps item i to its type (vertex, edge, or non-edge). 

Next, we show that M = M XY - Let <fi be a partial isomorphism from X to Y, let i 
be an item of X, and let = r be the image of i in Y. Since (f> is a partial isomorphism, 
we have xi = yj. In addition, <f> preserves the type of items i and r. Hence, the set T{(p) 
is a subset of Tt\ Y - This proves M C M XY - 

Now assume that is a p-morphism from M XY . Since T(<f>) is a subset of Tl XY , it 
is sufficient to show that <fi is bijective. Assume that <j> is not bijective. Then cf) is not 
injective, because is a partial morphism. Hence, there are distinct vertices i, j of X 
with v* = y = r. Then <j> maps a non- vertex item (i,j) to a vertex item (r, r), which 
contradicts the condition type(i) = type(r). This proves M XY C M. Combining both 
results yields the assertion. □ 

Though the proof of Proposition [T] is fairly simple, it illustrates the basic approach 
how to show that a certain subset M of Mxy is p-closed. The task is to construct an 
appropriate property p such that we can show M = M XY . 

We conclude this section with a formal proof that the set M2 from Example [4] is not 
p-closed. 
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Abbildung 1: Graphs X and Y with association graph Z = X o Y for the maximum 
common connectivity preserving induced subgraph problem. Different fil- 
lings refer to different vertex attributes. All edges have the same attribute. 
Highlighted edges in X and Y correspond to isomophic subgraphs. Note 
that the highlighted subgraph in X is not induced. Highlighted edges in Z 
refer to the maximum clique. 

Example 4. Consider the set Ai m of partial morphisms 4> : V{X) — > V(Y) with 
\V{(j))\ < m < \X\, where m > 2. There is no property p on Ixy such that M. m is 
p-closed. 

Proof: Let p denote the property on Ixy such that the set of all partial morphisms 
M.xy is p-closed. According to Proposition [TJ such a property exists. Consider the set 
M.2- It is easy to see that 

|J T{4>)=n XY . 

On one hand, the relation 1Z XY 1S t°° large, because it admits arbitrary partial mor- 
phisms as p-morphisms. On the other hand, H X y 1S a m i n i m &l set in the following sense: 
If we remove an element (i, r) from Tt X Y > then the morphism cj) G M2 with = r is no 
longer a p-morphism. This shows the assertion. □ 

3.2 Construction of an Association Graph 

We present the usual constructive definition of an association graph for GMP Q defined 
on a p-closed search space M.. 

For each element (i,r) £ Ixy, we check whether item i from X and item r from 
Y can be associated to a vertex or edge in an association graph. The morphisms from 
the search space Ai determine the association rule via its describing property p. We 
associate item i from X with item r from Y if they are p-similar. Thus, an association 
graph Z = X o Y is a weighted graph with vertex and edge set 

V{Z) = {ir : ~ p (r,r)} C V{X) x V{Y) 
E(Z) = {(ir,js) : (i,j) ~ p (r,s)] C V(Z) x V(Z). 
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The matrix Z = (zi r j s ) with 




e : (ij,rs)eE(Z) 
Kijrs '■ otherwise 



assigns weights to the vertices and edges of Z. Note that an association graph assigns real- 
valued weights to both, its vertices and edges. This is in contrast to standard association 
graph formulations, where Z is either unweighted (constant weights) or weights are 
assigned to vertices only. 

3.3 A Necessary and Sufficient Condition 

In this subsection, we show that p-closure of At is a necessary and sufficient condition 
for the desired equivalence between graph matching and clique search in an association 
graph. Note that equivalence means that there is a bijective mapping $ : Cz —* Ai 
with oj(C) = /(<fr(C)) for all cliques C G Cz- The mapping $ then induces a one-to-one 
correspondence between the maximum (maximal) cliques in Z and the global (local) 
optimal solutions from Ai. 

Theorem 1. Let X and Y be graphs. Then the GMP |5|) of X and Y is equivalent to 
the MWCP in a n-association graph Z = X oY if, and only if, there is a property p on 
Ixy such that the search space Ai of |5|) is the set of all p-morphisms from AixY ■ 

Proof: The =>■ - direction is trivial. Suppose that GMP (|5j is equivalent to the MWCP 
in Z. Then we simply define the property p with i ~ p r if there is a morphism eft G Ai 
with $ = r. 

Now let us show the opposite direction. Suppose that there is a property p on Ixy 
such that Ai is the set of all p-morphisms from AixY- We want to show that there is a 
bijection 

<5>:C Z ^M, C^4> c 

such that uj(C) = f(4>c,X,Y) for all C G Cz- With each clique C G Cz we associate a 
partial morphism (f>c '■ V(X) — ► V(Y) such that <f>c(i) = r f° r all {h r ) £ C. We show 
that 4>c is a feasible morphism from A4. Let i,j G V(X) and r, s G V(Y) be vertices with 
4>c(i) = r and <f>c{j) = s - By construction, (i, r) and (j, s) are members of clique C. Since 
Z[C] is complete, there is an edge incident with (i,r) and (j,s). Hence, (i,r) ~ p (j, s) 
and, therefore, T(4>c) Q K^y. Since A'i is p-closed, we have 4>c G M.. 

Similarly, with each morphism (j) G «M we associate a subset C<i of V(Z) with 

(i, j) G <S> 0(i) = j or 0(j) = i. 

Since is a feasible p-morphism, is a clique in Z. It is straightforward to show that 
both associations give rise to well-defined mappings 

§:C Z ^M, Cv+fa 
*:M^Cz, c^^C^. 
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Tabelle 1: Examples of standard graph matching problems. 



Type 


Graph Matching Problem 


X C .Mxy 


1 


Maximum Common Subgraph Problem 

— > Special case: Subgraph Isomorphism Problem 


partial subgraph-morphisms 
total subgraph-morphisms 


2 


Maximum Common Induced Subgraph Problem 

— > Special case: Induced Subgraph Isomorphism Problem 

— > Special case: Graph Isomorphism Problem 


partial isomorphisms 

total isomorphisms to subgraph of 

total isomorphisms 


3 


Maximum Common Homomorphic Subgraph Problem 
— > Special case: Subgraph Homomorphism Problem 


partial homomorphisms 

total homomorphisms to subgraph 



From $ o $ = id on jM and \& o 3> = id on Cz, it follows that $ is bijective. Finally, the 
assertion follows from 

u(C) = ^ z irjs= ^2 = f(^C,X,Y). 

ir,js£C ieV(<j> c ) 

□ 

3.4 Implications of Theorem 1 

Theorem [T] has the following implications: 

• Equivalence proofs now reduce to showing that the search space M of a GMP is 
p-closed for some property p. 

• Theorem [T] is the starting point for generic continuous solutions to GMPs defined 
on p-closed search spaces. An equivalent continuous formulation of the MWCP is 



given in 10 . 



4 Application of Theorem 1 

The aim of this section is threefold: First, we show that common graph matching pro- 
blems satisfy the necessary and sufficient condition. Second, we want to illustrate how 
the necessary and sufficient condition simplifies equivalence proofs. Third, we show the 
equivalence relationship for further examples, such as the graph edit distance, for which 
the desired relationship has been unproven. 



4.1 Simple Exact Graph Matching Problems 

In this subsection, we consider the problems listed in Table |4.1| For the first two types 
of problems equivalence to clique search is well-known. Homomorphic GMP of type 3 in 
Table 4.1 have been considered in a slight variation for attributed trees pjl. 
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An exact graph matching problem is a GMP ([5]), where the compatibility values of the 
matching objective are of the general form 



ay 

OLE 
a E 





Xi = yj , ieV(X), j€V(Y) 
Xi =_ yj , i £ E(X), j G E{Y) 

ieE(x),jeE(Y) w 

otherwise 



for all items i of X and j of Y. We require that the parameters ay, ag, and a^ are 
nonnegative such that > 0. Thus, exact matching problems only credit exact 

correspondences between items of X and Y . Standard formulations of the problems listed 



in Table 4.1 aim at maximizing the cardinality of vertices of the common substructure. 
Thus, we may set ay = 1 and = as = 0. If we want to maximize the cardinality of 
edges of the common substructure, we may set aE = 1 and ay = a^ = 0. Other choices 
of the parameters ay, aE, and ag- reflect the importance of an exact association between 
corresponding items. 

Different types of exact graph matching problems differ from the definition of the 
search space M as indicated in Table |4.1| Note that the special cases considered here are 
GMPs constrained over subsets of total morphisms. Clearly, subsets of total morphisms 
are not p-closed. Hence, in a strict sense, the special cases are not equivalent to clique 
search. To argue consistently, we regard a special case as an instance of the corresponding 
generic case, where we make an additional decision based on the optimal solution. For 
example, we think of the isomorphism problem as a MCISP, where we decide that both 
graphs under consideration are isomorphic if they have the same cardinality of vertices 
as a maximum clique in a derived association graph. Using this convention, we can show 



that the examples in Table 4.1 are equivalent to clique search in an association graph. 



Corollary 1. Consider the GMP ^ with compatibility values of the form |6]). Then 
the problems listed in Table \4-l\ o,re equivalent to the MWCP in an association graph. 



Proof. From Proposition [T] follows that the sets of partial subgraph-, iso-, and homo- 
morphisms are p-closed. For problems, which are maximized over total morphisms, we 
may relax M. to partial morphisms without affecting the optimal solutions of since 
ay, aE, and a^ are nonnegative with ay + oe + a^ > 0. The assertion follows from 
Theorem [U □ 



4.2 Inexact Graph Matching Problems 

In this subsection, we consider inexact graph matching problems. Again let X and Y be 
graphs with adjacency matrices X = (xij) and Y = (yij). 

Best Common Subgraph Problems 

The best common subgraph problem is a GMP (JHl), where the compatibility values of 
the matching objective are arbitrary real values. The search space A4 is either the set 
of all partial morphisms or the set of all partial monomorphisms from X to Y. 
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Corollary 2. Let X and Y be attributed graphs. Then the best common subgraph problem 
is equivalent to the MWCP in Z = X oY . 

Proof. Proposition [T] and Theorem [T} □ 

Probabilistic Graph Matching Problem 

In probabilistic graph matching, a probability model is drawn to measure compatibility 
between items. The aim is to then find a morphism from X to Y that maximizes a global 
maximum a posteriori probability. To describe the probabilistic graph matching problem 
in formal terms, we first introduce a distinguished null color ey for vertices not contained 
in A. Next, we extend the model Y by including an isolated vertex with null color ey. 
The probabilistic graph matching problem is defined by 

maximize X, Y) = P(<f>\X, Y) (?) 

subject to <ft £ M.xy 

where the matching objective P((j)\X, Y) is the a posteriori probability of (j) given the 
measurements X and Y^ Applying Bayes Theorem, we obtain 

where P((f>) is the joint prior for <fi. The quantities p(X ,Y\(p) and p(X,Y) are the 
conditional measurement density and the probability density functions, respectively, for 
the sets of measurements. 

Probabilistic graph matching problems are usually solved by a Bayesian inference 
scheme that does not require explicit calculation of compatibility values in advance. 
Conceptually, there are compatibility values and a probabilistic graph matching problem 
turns out to be a GMP ([5| over the set of all total morphisms, where we extend the 
graph Y by an isolated vertex with void attribute ey. 

Corollary 3. The probabilistic graph matching problem of X and Y is equivalent to the 
MWCP in Z = XoY', where Y' extends Y by an isolated vertex with void attribute ey. 

Proof. Proposition [T] and Theorem [TJ □ 

Graph Edit Distance Problem 

The concept of graph edit distance generalizes the Levenshtein edit distance originally 
defined for strings [13]. The graph edit-distance is defined as the minimum cost over all 
sequences of basic edit operations that transform X into Y. Following common use, the 
set of basic edit operations are substitution, insertion, and deletion of items. Different 
cost functions can be assigned to each edit operation. 

The sequences of edit operations that make X and Y isomorphic can be identified with 
the partial monomorphisms cp : V{X) — ► V(Y). Each partial monomorphism induces 
a bijection (ft : T){(ft) i— > lZ((ft) from the domain T>(cj)) to the range TZ(4>) of cj>. In terms of 
(j), the edit operations have the following form 



In accordance with the terminology used in 21 we refer to X and Y as the sets of measurements. 
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• Substitution: An item i from T>((j)) is substituted by item from 7Z(4>). 

• Deletion: Items i of D{(j)) = I{X) \ T>(4>) are deleted from X. 

• Insertion: Items j from 1Z{4>) = I(Y) \ 1Z{4>) are inserted into Y. 

If X| = y^ , the substitution i by is called identical substitution. The cost of a partial 
monomorphism (j) is then defined by 

f(<P,X,Y)= C del {i)+ Y, dnsU)+ E C sub (i,i*), (9) 

iev(<t>) jell{(t>) ieT>{<f>) 

where Cd e i(i) is the cost of deleting item i of X, Ci ns {i) is the cost of inserting an item 
i into Y, and C su b(i,j) is the cost of substituting an item i from X by an item j of 1". 
We assume that all costs are nonnegative. 

The graph edit distance problem is then of the form 

minimize f (6, X, Y\ 

V ; (10) 

subject to 4> £ M. 

where the matching objective / is defined as in (|9| and M is the subset of all partial 
monomorphisms from XtoY. The constrained global maximum of — / is the graph edit 
distance of X and Y. 

To show equivalence between the graph edit distance problem and clique search, we 
transform problem (10) to our standard form of a GMP as given in ([5]). First, we expand 



the set A of attributes to A' = A U {d} by including the distinguished symbol d. We 
call items with attribute d dummy items. Next, we expand X and Y by adding dummy 
vertices. Suppose that X and Y are of order |X| = n and |y| = m, respectively. Insert 
m dummy vertices into X and n dummy vertices into Y . Connected each dummy vertex 
with all the other (original and dummy) vertices by dummy edges. Let X' and Y' be 
the resulting expanded graphs. We call X' and Y' the dummy extensions of X and Y. 
Finally, we define an appropriate compatibility function k. To this end, we first introduce 
some auxiliary notations. By V(X, a) we denote the subset of all vertices of X that have 
attribute a. Furthermore, let C' sub be a nonnegative real-valued function on I(X') x I(Y') 
such that 

C sub (i,j) : i e I(X), j e I(Y) 
: otherwise 



Now we consider the following compatibility values of X' and Y' 



Kij 



-Cdelii) 

-Ci ns (j) 



i G V(X), j £ V(Y', d) 

t€ V(X',d), j G V(Y) (11) 

otherwise 



for all items i of X' and j of Y' . Then the GMP over the set of all total monomorphisms 



from X' to Y' is equivalent to the graph edit distance problem (10). Note that we 
consider negative costs as compatibility values to turn the minimization problem (10) 
into a maximization problem as in 
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Corollary 4. Let X and Y be attributed graphs. Then the graph edit distance problem 
is equivalent to the MWCP in Z = X' oY' , where X' and Y' are the dummy extensions 
of X and Y. 

Proof. Proposition [T] and Theorem [TJ □ 

Solving the graph edit distance by clique search in Z = X' o Y 1 is impractical, because 
the dummy extensions double the total number of vertices of both original graphs. In 
a practical implementation, it is sufficient to expand each of both graphs X and Y by 
adding only one dummy vertex as described above. This leads to a formulation of the 
graph matching problem constrained over relations <p C V{X) x V(Y) rather than partial 
morphisms. Though the theory developed in this contribution also holds for relations, 
we do not consider the more general case for the sake of clarity. 

5 Conclusion 

We presented a necessary and sufficient condition (C) for graph matching to be equivalent 
to the maximum weight clique problem in an association graph. The implications of this 
result are as follows: first, equivalence proofs now reduce to showing that condition 
(C) holds; second, the condition (C) is applicable to a broad range of common graph 
matching problems; third, generic continuous solutions can now be applied to all graph 
matching problems that satisfy (C). 

We showed that the graph edit distance problem, probabilistic graph matching, the 
best common graph matching, and the maximum common homomorphic subgraph pro- 
blem are equivalent to clique search in a derived association graph. 

One limitation with this framework motivates further research: The maximum weight 
clique problem, where weights are assigned to both vertices and edges has been rarely 
studied in the literature. Hence, it is conducive to develop special continuous formulations 
and solutions to the maximum weight clique problem in order to obtain a generic graph 
matching solver. 
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